Bayesian statistical effort prediction models for data-centred 4GL software development

نویسندگان

  • C. van Koten
  • A. R. Gray
چکیده

Constructing an accurate effort prediction model is a challenge in Software Engineering. This paper presents three Bayesian statistical software effort prediction models for database-oriented software systems, which are developed using a specific 4GL tool suite. The models consist of specification-based software size metrics and development team’s productivity metric. The models are constructed based on the subjective knowledge of human expert and calibrated using empirical data collected from 17 software systems developed in the target environment. The models’ predictive accuracy is evaluated using subsets of the same data, which were not used for the models’ calibration. The results show that the models have achieved very good predictive accuracy in terms of MMRE and pred measures. Hence it is confirmed that the Bayesian statistical models can predict effort successfully in the target environment. In comparison with commonly used multiple linear regression models, the Bayesian statistical models’ predictive accuracy is equivalent in general. However, when the number of software systems used for the models’ calibration becomes smaller than five, the predictive accuracy of the best Bayesian statistical models are significantly better than the multiple linear regression model. This result suggests that the Bayesian statistical models would be a better choice when software organizations/practitioners do not posses sufficient empirical data for the models’ calibration. The authors expect those findings encourage more researchers to investigate the use of Bayesian statistical models for predicting software effort.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Effort Prediction Model for Data-Centred Fourth-Generation-Language Software Development

Accurate effort prediction is often an important factor for successful software development. However, the diversity of software development tools observed today has resulted in a situation where existing effort prediction models’ applicability appears to be limited. Data-centred fourth-generation-language (4GL) software development provides one such difficulty. This paper aims to construct an a...

متن کامل

Bayesian Statistical Models for Predicting Software Development Effort

Constructing an accurate effort prediction model is a challenge in Software Engineering. This paper presents new Bayesian statistical models, in order to predict development effort of software systems in the International Software Benchmarking Standards Group (ISBSG) dataset. The first model is a Bayesian linear regression (BR) model and the second model is a Bayesian multivariate normal distri...

متن کامل

A Survey of Bayesian Net Models for Software Development Effort Prediction

This paper discusses recent Bayesian nets built for software development effort prediction. Its aim is to bring closer these models as they may be competitive for other modeling techniques, especially for data-driven machine learning and statistical techniques. Each model has been briefly described and then analyzed in detail in terms of its main purpose, type of structure, data/knowledge base ...

متن کامل

Bayesian Two-Sample Prediction with Progressively Type-II Censored Data for Some Lifetime Models

Prediction on the basis of censored data is very important topic in many fields including medical and engineering sciences. In this paper, based on progressive Type-II right censoring scheme, we will discuss Bayesian two-sample prediction. A general form for lifetime model including some well known and useful models such asWeibull and Pareto is considered for obtaining prediction bounds ...

متن کامل

A Validation Test Naive Bayesian Classification Algorithm and Probit Regression as Prediction Models for Managerial Overconfidence in Iran's Capital Market

Corporate directors are influenced by overconfidence, which is one of the personality traits of individuals; it may take irrational decisions that will have a significant impact on the company's performance in the long run. The purpose of this paper is to validate and compare the Naive Bayesian Classification algorithm and probit regression in the prediction of Management's overconfident at pre...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Information & Software Technology

دوره 48  شماره 

صفحات  -

تاریخ انتشار 2006